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Gastric and Colon Cancer-associated Antigens 



The invention relates to isolated nucleic acid sequences which are expressed m cancers, 
especially gastrointestinal cancers, to their protein products and to the use of the nucleic 
acid and protein products for the identification and treatment of cancers. 

Cancers of the intestinal tract, such as gastric carcinomas and colorectal cancers, account 
for up to 15% of cancer-related deaths in the United States, and have low survival rates. 
Such cancels are often asymptomatic, the patieait only becoming aware of them when the 
cancers have progressed too far to be successfiilly treated. There is therefore a need to 
identify new diagnostic tools and methods for treating such cancers. 

Identification of immunogenic proteins in cancer is essential for the development of 
immunotherapeutic strategies where adoptive immunity is directed towards MHC Class I- 
and Class H-associated peptides (Mians, et cd.. Cancer Immunology (2001), page 1). Many 
antigens are implicated in aetiology and progression of cancer, and are associated with 
epigenetic events. Pre-clinical and clinical studies infer that vaccination and targeting 
MHC-associated peptide antigens promotes tumour rejection (Ali SA., et al, J Lnmunol. 
(2002), Vol. 168(7), pages 3512-19 and Rees R.C., et al., Immunol. Immunother (2002), 
Vol 51(1), pages 58-61). 

The inventors have used a technique known as SEREX (Serological Identification of 
Antigens by Recombinant Expression Cloning) to identify genes which are over-expressed 
in cancer tissue. This technique was published by Sahin et al (PNAS (USA). 1995, Vol. 
92, pages 11810-11813). SEREX uses total RNA isolated from tumour biopsies firoin 
v^ch poly(A)* RNA is then isolated. cDNA is then produced using an oligo (dT) primer. 
The cDNA firagments produced are then cloned into a suitable expression vector, such as a 
bacteriophage and cloned into a suitable host, such as Rcoli. The clones produced are 
screened with high-titer IgG antibodies in autologous patient serum, to identify antigens 
associated with the tumour. 
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Sevo-al SEREX-defined antigens have provided attractive candidates for the construction 
of cancer vaccines, for example NY-ESO-1 firom testis (Chen Y.T., et al. (1997), Vol. ?4, 
page 1914; Stockert E., et al, J. Exp. Med. (1998), VoL 187, page 1349; Jager E., et cd.. 
PNAS (2000), Vol. 97, page 12198; and Jager E., et al, PNAS (2000). Vol. 97, page 
4760). Mutated p53 (Scanlan M.J., et al. Int. J. Cancer (1998), Vol. 76, page 652), 
putative tumour suppressor ING 1 (Jager D. et al.. Cancer Res. (1999), VoL 59, page 6416) 
and adhesion molecule galectin 9 (Tureci O., et al, J. Biol. Chem. (1997), Vol. 272. page 
6416); for example, have been detected by SEREX, showing that the analysis of 
autoantibodies can identify genes involved in cancer etiology and identify diagnostic 
markers or indicators of disease progression. 

The inventors have used this technique to identify genes and gene products associated with 
gastric cancer. 

A first aspect of the invention provides an isolated mammalian nucleic acid molecule 
comprising a sequence selected from SEQ.ID;1 and SEQ.ID.2. Preferably Ihe isolated 
nucleic acid molecule encodes a mammalian antigen which is expressed in higher than 
normal concentrations in cancer cells, compared wilh normal non-cancerous ceUs. 
Preferably the cancer is a gastro-intestinal cancer. The term "higher than normal 
concentrations" preferably means that Ihe protein is expressed at a concentration at least 5 
times greater in tumour cells than normal cells. 

Preferably the nucleic acid molecule encodes TACCl-D (SEQ.ID.l). 



TACC1 splice variant (TACCI-D); full-length mRNA 

>acagccgcccgccgcccagcacaggagggtgcagccccggccccaagttctgcgccatgggaggctcccactctcagacccc 

gaggggccgggaacccgccggggagaggcacccgagacccacggagaccgcgactactgaacaagtgaaatttctctgttttct 

gttgagtggctgtaaggtgaagaagcatgaaactcagtctctcgccctggatgcatgttctcgggatgaaggggcagtgatctccca 

gatttcagacatttctaatagggatggccatgctactgatgaggagaaactggcatccacgtcatgtggtcagaaatcagctggtgcc 

gaggtgaaaggtgagccagaggaagacctggagtactttgaatgttccaatgttcctgtgtctaccataaatcatgcgttttcatcctca 

gaagcaggcatagagaaggagacgtgccagaagatggaagaagacgggtccactgtgcttgggctgctggagtcctctgcagag 



3 

aaggcccctgtgtcggtgtcctgtggaggtgagagccccctggatgggatctgcctcagcgaatcagacaagacagccgtgctca 
ccttaataagagaagagataattactaaagagattgaagcaaatgaatggaagaagaaatacgaagagacxcggc^ 
gagatgaggaaaattgtagctgaatatgaaaagactattgctcaaatgattgaagatgaacaaaggacaagtatgacctctcagaag 
agcttccagcaactgaccatggagaaggaacaggccctggctgaccttaaxjtctgtggaaaggtocc^ctgatct^ 



acaagaggagcagcgataccaggccctgaaaatccacgcagaaga^aactggacaaagccaatgaagagattgcteaggttcg 
aacaaaagcaaaggctgagagtgcagctctccatgctggactccgcaaagagcagatgaaggtggagtocctggaaagggccct 
gcagcagaagaaccaagaaattgaagaactgacaaaaatctgtgatgagctgattgcaaagctgggaaagactgactgagacact 
oiccctgttagctcaacagatcrtgcamggctgcttctcttgtgaccacaattatcttgccttatccaggaataattg 

Pfl qaaaagflaar: t t ?''"'»"a"g''a^-«tgr^tactgctgCCtgtoCCgCfflgCtgCCaatgCaacagC^^ 

.ggttgcatagtctagaaaggagtgtgacctgacagtgctggagcctcctagtttcc(x;ctatgaaggtt 

ggmgtgatttatctttagtttgttttaaagtcatctttactttcccaaatgtgt^^ 

ctgattHtttgtgatctgtttaatottttaatttt^ 

agaaggggctctggatccccttttaaattacacacactcrtcacacacatacatgtatgtttatagatgrtgctg 
agtcaagtaagaactgctctacagaaggacatatttc(rttggatgtgagaccctattttgaaatagagt^^ 
aagaamgggggattaaagatgtgaagaccacagtcttgggttttcatatctggagaagactatttgcc^^ 
atttggacactcctcagctttaatgggtgtgg(xcctttagggttagtcctcagactaatgatagtgtctgcttt^^ 
atgggactccctccaagctagggtttggcaagtctgccctagagtcalttactctcctctgcctcxatttgttaate^^ 
agtottcattatcttttttttttttWgagacagagmcgatctattt^ 
aafloataflttatflcrcacattaccaa1tataaget2aaeaaatEttttmc(xa^ 



aaaaaataccrttctaacttaagacagaatttttaacaaaatgagcagtaaaagtcacatgaacc^c^^ 

atttttaaacaaagacagcttgttgaatactgagaagaggagtgcaaggagaaggtctgtactaacaaagccaaattccte 

actggactcagttcagagtggtgggccattaaccccaacatggaatttttccatataaatct^^ 

aacccaaatccatgcaagtgttttaaagcactgtcctgtcttaatettacatgctgaaag^cttcat^^ 

cgtatgttttcctacttctcttgtaaaactgttgcatgatMaacttcagcaatgaattgtgc^^^ 

atgaattattctttagcagtgtattactcacatgggtgcaatctttagccccagggaggtcaataatgtcttttaaag 

ttaccaatatgcatttatcataattggtgcttaggctgtatatfcaagcxtgttgtcttaaca^ 

tgtcatttgagaagtggcttgacaMcalttgagctttgaaagcagtcactgtggtgtaatatgaatgctgtcct^ 

agggcacgtgtotccccttggtataactgatttixtttttagtcctctactgc^ 

tgctaaatctltttgctgctgtgttttggtgttttcatgttta(rttgttttatattgat^ 

aaatccatagtcatctttttaagcttattgtgtttaagaaagtagctatgt^^ 
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ctggccrtctcagccatgaccgttatgaggaaatataxxcattcgaacttaacagatgcctcc^^^ 

ttgtacagatcaagagaatatactgggcagaatgaagtatgmgtttatttttctttaaaaataaaggattt^ 

aatatagtata^gtttgcctcaacacatgtgagggccaaataacctgctagctaggcagtaataaactrt 

gggccgggcacagtggcttattcctgtaatcccaacactgtggaaggccgaggcaggaggatcacttga^ 

ctacxtaggcaacatggtgaaaccttgtctctaccaaaataaaaattagctgggcatggtggcacgtgcrtgt^^ 

ggaggctgaggtgggagcctgggaggtcaaggctgcagtgagccatgatcatgcxactgcactccatcctgggtgacagcaa^^ 

cttGtf'trnaaaaafiP^'' '^^a^^^aaat^aggagtgaaaaaggaaaetaaaaggcagctgctggcctaaat^^ 

ggaatattaggtgatcctgttgaaattctggatccaaagcaatttctttagctWgactttgccaaagtgtaaatagccm 

tttttaagggg^aalgcaacgggaggccaactgaacaattccccccgtggctgcccagatagtcacagtcaaggtt^ 

ccttccagccagggacctacccaaacctffigttctgtaaaactgctctggaaataccgggaagcccagttttctc^ 

ttcttcggactcagcccaatttaggagtgccgaagcacatgatgg// 

Transforming acidic coiled-coil (TACC) proteins are cenlrosome and 
microtubule-associated proteins that are essential for mitotic spindle production (Gergely 
F., et al, PNAS (USA) (2000), Vol. 97, pages 14352-57; Oergely F. et al, EMBO J. 
(2000), Vol. 19, pages 241-252; and Lee M.J., et al, Nat. Cell. Biol., Vol. 3, pages 
543-649). TACC-1 in mouse fibroblasts, when over-expressed, results in celMar 
transformation and anchorage independent growth (Still I.H., et al. Oncogene (1999), Vol. 
18, pages 4032-4038). High levels of TACC-3 mRNA have been found in various cancer 
cell lines (Still LH., Genomics (1999), Vol. 58, pages 165-170) but TACC-2 (AZU-1) has 
been identified as a potential breast tumour suppressor and is downregulated in breast 
carcinoma cell lines (Chen H.M., Mol. Biol. CeU (2000), Vol. 1 1, pages 1357-1367). 

TACC-1 has now been identified as an immunogenic protein and a potential tumour 
antigen. 5'RLM-RACE and RT-PCR analysis identified a transcript variant, designated 
TACCl-D as bemg relatively strongly expressed in 50% of gastric tissue samples analysed. 
The variant is only weakly detectable in normal kidney and colon tissues but not in other 
normal tissues. 



Five other TACC-1 splice variants have also been found (TACCl-A, TACCl-B, 
TACCl-C, TACCl-E and TACCl-F). TACCl-A, TACCl-B, TACCl-C and TACCl-E 
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were expressed universally in all normal tissues tested. TACCl-F was expressed in brain 
and gastric tumous to a similar level. 

Preferably the isolated nucleic acid sequence encodes AD034 (SEQ.ID.2). 



ADO 3 4 znBNA seqpience 



1 gggtggtgga tctgtcggtc ccgttttccc gtcgcacgtg 
gtggccactg ttggcttctg 

61 aatggtttgc aaggcggata tccacgccaa ggcctttgga 
tcggccgtgg gtacatccgt 

121 ctgagccgtt cctttccatc gcagagcggc ggcctccggc 
ggcgctctcc agtcatggac 

181 taccggcggc ttctcatgag ccgggtggtc cccgggcaat 
tcgacgacgc ggactcctct 

241 gacagtgaaa- acagagactt gaagacagtc aaagagaagg 
atgacattct gtttgaagac 

301 cttcaagaca atgtgaatga gaatggtgaa ggtgaaatag 
aagatgagga ggaggagggt 

361 tatgatgatg at gat gat ga ctgggactgg gatgaaggag 
ttggaaaact cgccaagggt 

421 tatgtctgga atggaggaag caacccacag gcaaatcgaq 
agacctccga cagcagttca 

4 81 gccaaaatgt ctactccagc agacaaggtc ttacggaaat 
ttgagaataa aattaattta 

541 gataagctaa atgttactga ttccgtcata aataaagtca 
ccgaaaagtc tagacaaaag 

601 gaagcagata tgtatcgcat caaagataag gcagacagag 
caactgtaga acaggtgttg , 

661 gatcccagaa caagaatgat tttattcaag atgttgacta 
gaggaatcat aacagagata 

721 aatggctgca ttagcacagg aaaagaagct aatgtatacc 
atgctagcac agcaaatgga 

7 81 gagagcagag caatcaaaat ttataaaact tctattttgg 
tgttcaaaga tcgggataaa 

841 tatgtaagtg gagaattcag atttcgtcat ggctattgta 
aaggaaaccc taggaaaatg - 

901 gtgaaaactt gggcagaaaa agaaatgagg aacttaatca 
ggctaaacac agcagagata 

961 ccatgtccag aaccaataat gctaagaagt catgttcttg 
tcatgagttt catcggtaaa 

1021 gatgacatgc ctgcaccact cttgaaaaat .gtccagttat 
cagaatccaa ggctcgggag 

1081 ttgtacctgc aggtcattca gtacatgaga agaatgtatc 
aggatgccag acttgtccat 

1141 gcagatctca gtgaatttaa catgctgtac cacggtggag 
gcgtgtatat cattgacgtg 

1201 tctcagtccg tggagcacga ccacccacat gccttggagt 
tcttgagaaa ggattgcgcc 



1261 aacgtcaatg atttctttat gaggcacagt gttgctgtca 
tgactgtgcg ggagctcttt 

1321 gaatttgtca cagatccatc cattacacat gagaacatgg 
atgcttatct ctcaaaggcc 

1381 atggaaatag catctcaaag gaccaaggaa gaacggtcta 
gccaagatca tgtggatgaa 

14 41 gaggtgttta agcgagcata tattcctaga accttgaatg 
aagtgaaaaa ttatgagagg 

1501 gatatggaca taattatgaa attgaaggaa gaggacatgg 
ccatgaatgc ccaacaagat 

1561 aatattctat accagactgt tacaggattg aagaaagatt 
tgtcaggagt tcagaaggtc » 

1621 cctgcactcc tagaaaatca agtggaggaa aggacttgtt 
ctgattcaga agatattgga 

1681 agctctgagt gctctgacac agactctgaa gagcagggag 
accatgcccg ccccaagaaa 

17 41 cacaccacgg accctgacat tgataaaaaa gaaagaaaaa 
agatggtcaa ggaagcccag 

18 01 agagagaaaa gaaaaaacaa aattcctaaa catgtgaaaa 
aaagaaagga gaagacagcc ' 

18 61 aagacgaaaa aaggcaaata gaatgagaac catattatgt 
acagtcattt tcctcagttc 

1921 cttttctcgc ctgaactctt aagctgcatc tggaagatgg 
cttattggtt ttaaccagat 

1981 tgtcatcgtg gcactgtctg tgaagacgga ttcaaatgtt 
ttcatgtaac tatgtaaaaa 

2041 gctctaagct ctagagtcta gatccagtca ctgactctgt 
ctggtgttga cagaggattt 

2101 atttaagcta ttattttaat aaagaacttt gtacattttt 
atttttatat ttttttctct 

2161 tacaaatatg tttttggaag catgataaat gtttaaatgt 
agtcaacatc tgtaactctt 

2221 acatgagtgt ccagaggcac tcatgggaaa attggttttg 
ctttctttgt acacaccaga 

2281 gacccatctg aggtcatctg attataaggc catgtttata 
taaagggaat ttcacccaca 

2341 gttcagctgg ctgttgattt tcactgcaac tctgcctttg 
tgtgtattgg cgatcatttg 

2401 taatgctctt acacttcgtc tttaatgttc tttttggagt 
taggacctct cagttcataa 

24 61 agttttttac aattcaaaaa aaaaaaaaaa aaaaa 



AD034 encodes a tyrosine kinase motif and has similarity to the RIO1/ZK632.3/MJ0444 
faihily. RT-PCR showed that the protem contains a 32-bp frame shift mutation which is 
not associated with the increased levels observed in colorectal cancer patients. The 32-bp 
sequence is a minor mKNA variant and is detectable in normal tissues where AD034 is 




expressed and no significant differences iii ratios of either isoform were observed between 
colorectal tumovirs and adjacent normal tissues. 



cDNA sequence of AD034 with 32bp insertion (SEQ. ID 3) 



1 gggtggtgga tctgtcggtc ccgttttccc gtcgcacgtg gtggccactg ttggcttctg 
61 aatggtttgc aaggcggata tccacgccaa ggcctttgga tcggccgtgg gtacatccgt 
121 ctgagccgtt cctttccatc gcagagcggc ggcctccggc ggcgctctcc agtcatggac 
181 taccggcggc ttctcatgag ccgggtggtc cccgggcaat tcgacgacgc ggactcctct 
241 gacagtgaaa acagagactt gaagacagtc aaagagaagg atgacattct gtttgaagac 
301 cttcaagaca atgtgaatga gaatggtgaa ggtgaaatag aagatgagga ggaggagggt 
361 tatgatgatg atgatgatga ctgggactgg gatgaaggag ttggaaaact cgccaagggt 
421 tatgtctgga atggaggaag caacccacagCTAGTGCCTTAGACTCTGGAATTCCCTTCTAG 
gcaaatcgac agacctccga cagcagttca 

481 gccaaaatgt ctactccagc agacaaggtc ttacggaaat ttgagaataa aattaattta 
541 gataagctaa atgttactga ttccgtcata aataaagtca ccgaaaagtc tagaoaaaag 
601 gaagcagata tgtatcgcat caaagataag gcagacagag caactgtaga acaggtgttg 
661 gatcccagaa caagaatgat tttattcaag atgttgacta gaggaatcat aacagagata- 
721 aatggctgca ttagcacagg aaaagaagct aatgtatacc atgctagcac agcaaatgga 
781 gagagcagag caatcaaaat ttataaaact tctattttgg tgttcaaaga tcgggataaa 
841 tatgtaagtg gagaattcag atttcgtcat ggctattgta aaggaaaccc taggaaaatg 
901 gtgaaaactt gggcagaaaa agaaatgagg aacttaatca ggctaaacac agcagagata 
961 ccatgtccag aaccaataat gctaagaagt catgttcttg tcatgagttt catcggtaaa 
1021 gatgacatgc ctgcaccact cttgaaaaat gtccagttat cagaatccaa ggctcgggag 
1081 ttgtacctgc aggtcattca gtacatgaga agaatgtatc aggatgccag acttgtccat 
1141 gcagatctca gtgaatttaa catgctgtac cacggtggag gcgtgtatat cattgacgtg 
1201 tctcagtccg tggagcacga ccaeccacat gccttggagt tcttgagaaa ggattgcgcc 
1261 aacgtcaatg atttctttat gaggcacagt gttgctgtca tgactgtgcg ggagctcttt 
1321 gaatttgtca cagatccatc cattacacat gagaacatgg atgcttatct ctcaaaggcc 
1381 atggaaatag catctca'aag gaccaaggaa gaacggtcta gccaagatca tgtggatgaa 
1441 gaggtgttta agcgagcata tattcctaga accttgaatg aagtgaaaaa ttatgagagg 
1501 gatatggaca taattatgaa attgaaggaa gaggacatgg ccatgaatgc ccaacaagat 
1561 aatattctat accagactgt tacaggattg aagaaagatt tgtcaggagt tcagaaggtc 
1621 cctgcactcc tagaaaatca agtggaggaa aggacttgtt ctgattcaga agatattgga 
1681 agctctgagt gctctgacac agactctgaa gagcagggag accatgcccg ccccaagaaa 
1741 cacaccacgg accctgacat tgataaaaaa gaaagaaaaa agatggtcaa ggaagcccag 
1801 agagagaaaa gaaaaaacaa aattcctaaa catgtgaaaa aaagaaagga gaagacagco 
1861 aagacgaaaa aaggcaaata gaatgagaac catattatgt acagtcattt tcctcagttc 
1921 cttttctcgc ctgaactctt aagctgcatc tggaagatgg cttattggtt ttaaccagat 
1981 tgtcatcgtg gcactgtctg tgaagacgga ttcaaatgtt ttcatgtaac tatgtaaaaa 
2041 gctctaagct ctagagtcta "gatccagtoa ctgactctgt dtggtgttga cagaggattt 
2101 atttaagcta ttattttaat aaagaacttt gtacattttt atttttatat ttttttctct 
2161 tacaaatatg tttttggaag catgataaat gtttaaatgt agtcaacatc tgtaactctt 
2221 acatgagtgt ccagaggcac tcatgggaaa attggttttg ctttctttgt acacaccaga 
2281 gacccatctg aggtcatctg attataaggc catgtttata taaagggaat ttcacccaca 
2341 gttcagctgg ctgttgattt tcactgcaac tctgcctttg tgtgtattgg cgatcatttg 
2401 taatgctctt acacttcgtc tttaatgttc tttttggagt taggacctct cagttcataa 
2461 agttttttac aattcaaaaa ^aaaaaaaaa aaaaa 



The insertion is shown in upper case letters. 
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Fragments of the nucleic acid molecules which encode antigenic determinants umque to 
each protein are also included. 

Preferably such determinaats are specific for TACCl-D and do not cross-react with, e.g. 
TACCl-A, TACCl-B, TACCl-C, TACCl-E or TACCl-F. 

Preferably the determinants are specific for AD034 with or without its insertion- 
Nucleic acid molecules having at least 60%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99% 
homology to the nucleic acid molecules are also provided. Preferably these have 
TACCl-P activity or AD034 activity. 

The invention also includes, within its scope, nucleic acid molecules complementary to 
such isolated mammalian nucleic acid molecules. 

The nucleic acid molecules of the invention may be DNA, cDNA or RNA. hi RNA 
molecules "T*' (Thymine) residues may be replaced by "U" (Uridine) residues. 

Preferably, the isolated mammalian nucleic acid molecule is an isolated human nucleic acid 
molecule. 

The invention further provides nucleic acid molecules comprising at least 15 nucleotides 
capable of specifically hybridising to a sequence included within the sequence of a nucleic 
acid molecule according to the j&rst aspect of the invention. The hybridising nucleic acid 
molecule may either be DNA or RNA. Preferably the molecule is at least 90%, at least 
92%, at least 94%, at least 96%, at least 98%, at least 99%, homologous to the nucleic acid 
molecule according to the first aspect of the invention. This may be determined by 
techniques known in the art. 

The term "specifically hybridising" is intended to mean that the nucleic acid molecule can 
hybridise to nucleic acid molecules according to the iavention under conditions of high 
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stringency. Typical conditions for high stringency include 0.1 x SET, 0.1% SDS at SS^C 
for 20 minutes. 



The invention also encompasses variant DNAs and cDNAs which differ from the 
sequences identified above, but encode the same amino acid sequences as the isolated 
mammalian nucleic add molecules, by virtue of redundancy in the genetic code. 



U 



G 



U 



UUU 
UUC 
UUA 
UUG 



] 

.] 



Phe 



Leu 



UCU 
UCC 
UCA 

UCG 



Ser 



UAU 
UAC 
UAA* 
UAG* 



Stop 
Stop 



UGU 
UGC 
UGA* 

UGG 



Cys 

Stop 
T[E_ 



U 
C 
A 
G 



CUU -[ 

cue 

CUA 
CUG 



Leu 



ecu 

CCC 
CCA 

CCG 



Pro 



CAU 
CAC 
CAA 
CAG 



] 



His 



Gin 



ecu 

CGC 
CGA 
CGG 



Arg 



U 
C 
A 
G 



AUU -| 

Aue 

AUA 
AUG** 



lie 



Met 



ACU 
ACC 
ACA 
ACG 



Thr 



AAU 
AAC 
AAA 

AAG 



"j Asn 



AGU -j Ser 
AGC 

AGA "1 Arg 

AGO 



U 
C 
A 
G 



Guu n 
Gue 

GUA 

GUG** 



Val 



GCU 
GCC 
GCA 
GCG 



Ala 



GAU 
GAC 
GAA 
GAG 



~j Asp 
-J Ghi 



GGU 
GGC 
GGA 
GGG 



Gly 



U 
C 
A 
G 



* Chain-terminating, or "nonsense" codons. 

** Also used to specify the initiator formyl-Met-tRNAMet. The Val triplet GUG is 
therefore "ambiguous" in that it codes both vaUne and methionine. 

The genetic code showing mRNA triplets and the amino acids for which they code 
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The invention also includes within its scope vectors comprising a nucleic acid according to 
the invention. Such vectors include bacteriophages, phagemids, cosmids and plasmids. 
Preferably the vectors comprise suitable regulatory sequences, such as promoters and 
termination sequences which enable the nucleic acid to be expressed upon insertion into a 
suitable host Accordingly, the invention also includes hosts comprising such a vector. 
Preferably the host is E, colL 

A second aspect of the invention provides an isolated polypeptide obtainable from a 
nucleic acid sequence according to the invention. As indicated above, the genetic code for 
translating a nucleic acid sequence into an amino acid sequence is well known. 

Preferably the sequence is: 

AI)034 peptide sequence 

• /translation="MSRWPGQFDDADSSDSENRDLKTVKEKDDILFEDLQDNVNENG 
EGEIEDEEEEGYDDDDDDWDWDEGVGKLAKGYVWNGGSNPQANRQTSDSSSAKMSTPA 
DKVLRKFENKINLDKLNVTDSVINKVTEKSRQKEADMYRIKDKADRATVEQVLDPRTR 
MILFKMLTRGIITEINGCISTGKEANVYHASTANGESRAIKIYKTSILVFKDRDKYVS. 
GEFRFRHGYCKGNPRKMVKTWAEKEMRNLIRLNTAEIPCPEPIMLRSHVLVMSFIGKD 
DMPAPLLKNVQLSESKARELYLQVIQYMRRMYQDARLVHADLSEFNMLYHGGGVYIID 
VSQSVEHDHPHALEFLRKDCANVNDFFMRHSVAVMTVRELFEFVTDPSITHENMDAYL 
SKTUyiEIASQRTKEERSSQDHVDEEVFKRAYIPRTLNEVKNYERDMDIIMKLKEEDM^ 
NAQQDNILYQTVTGLKKDLSGVQKVPALLENQVEERTCSDSEDIGSSECSDTDSEEQG 
DHARPKKHTTDPDI DKKERKKMVKEAQREKRKNKI PKHVKKRKEKTAKTKKGK 

The invention further provides polypeptide analogues, j&agments or derivatives of antigenic 
polypeptides which differ from naturally-occurring forms in terms of the identity of 
location of one or more amino acid residues (deletion analogues containing less than all of 
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the residues specified for the protein, substitution analogues wherein one or more residues 
specified are replaced by other residues in addition analogues wherein one or more amino 
acid residues are added to a terminal or medial portion of the polypeptides) and which 
share some or all properties of the naturally-occurring forms. Preferably such polypeptides 
comprise between 1 and 20, preferably 1 and 10 amino acid deletions or substitutions. 

Preferably the polypeptide is at least 95%, 96%, 97%, 98% or 99% identical to the 
sequences of the invention. This can be determined conventionally using known computer • 
programs such as the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 
for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, 
Madison, WI 53711). When using Bestfit or any other sequence aUgnment program to 
determine whether a particular sequence is. for instance, 95% identical to a reference . 
sequence according to the present invention, the parameters are set, of course, such that the 
percentage of identity is calculated over the fiiU length of the reference ammo acid 
sequence and that gaps in homology of up to 5% of the total number of amino acid residues 
in the reference sequence are allowed. 

The nucleic acids and polypeptide of the invention are preferably identifiable using the 
SEREX method. However, alternative methods, known in the art, may be used to identify 
nucleic acids and polypeptides of the invention. These include differential display PGR 
(DD-PCR), representational difference analysis (RDA) and sx^pression subtracted 
hybridisation (SSH). 

All of the nucleic acid molecules according to the invention and the polypeptides which 
they encode are detectable by SEREX (discussed below). The technique uses serum 
antibodies from cancer patients to identify the molecules. It is therefore the case that the 
gene products identified by SEREX are able to evoke an immune response in a patient and 
may be considered as antigens suitable for potentiating further immune reactivity if used as 
a vaccine. 
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The third aspect of the invention provides the use of nucleic acids or polypeptides 
according to the invention, to detect or monitor cancers, preferably gastro-intestiuaal 
cancers, such as gastric cancer or colorectal cancer. 

The use of a nucleic acid molecule hybridisable under higji stringency conditions, a nucleic 
acid according to the jSrst aspect of the invention to detect or monitor cancers, e.g. 
gastro-intestinal cancers, such as gastric cancer or colorectal cancer, is also encompassed. 
Such molecules may be used as probes, e.g. using PGR. 

The expression of genes, and detection of their polypeptide products may be used to 
monitor disease progression during therapy or as a prognostic indicator of the initial 
disease status of the patient There are a number of techniques which may be used to detect 
the presence of a gene, including the use of Northem blot and reverse transcription 
polymerase chain reaction (RT-PCR) which may be used on tissue or whole blood samples 
to detect the presence of cancer associated genes. For polypeptide sequences in-situ 
staining techniques or enzyme linked ELISA assays or radio-immune assays may be used. 
RT-PCR based techniques would result in the amplification of messenger RNA of the gene 
of interest (Sambrook, Fritsch and Maniatis, Molecular Cloning, A Laboratory Manual, 2"^ 
Edition). ELISA based assays necessitate the usq of antibodies raised against the protein or 
peptide sequence and may be used for the detection of antigen in tissue or serum samples 
(Mclntyre C.A., Rees R.C. et al, Europ. J. Cancer 28, 58-631 (1990)). M-situ detection of 
antigen in tissue sections also rely on the use of antibodies, for example, immuno 
peroxidase staining or alkaline phosphatase staining (Graepel, J.R., Rees, R.C. et.al., Brit. J. 
Cancer 64, 880-883 (1991)) to demonstrate expression. Similarly radio-immune assays 
may be developed whereby antibody conjugated to a radioactive isotope such as I^^^ is used 
to detect antigen in the blood. 

Blood or tissue samples may be assayed for eleviated concentrations of the nucleic acid 
molecules or polypeptides. 

Methods of producing antibodies which are specific to the polypeptides of the invention, 
for example, by the method of Kohler & Milstein to produce monoclonal antibodies, are 
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well known. A furfher a^ect of the invention provides an antibody which specifically 
binds to a polypeptide according to the invention. 



Preferably, for example, the antibody bmds TACGl-D, and not TACCl-A, TACCl-B, 
TACCl-C, TACCl-E or TACCl-F. 

Kits for detecting or monitoring cancer, such as gastro-intestinal cancers, including gastric 
cancer and/or colorectal cancer, using polypeptides, nucleic acids or antibodies according 
to the .invention are also provided. Such kits may additionally contain instructions and 
reagents to carry out the detection or monitoring. 

The fourth aspect of the invention provides for the use of nucleic acid molecules according 
to the first aspect of the invention or polypeptide molecules according to the second aspect 
of the invention in the prophylaxis or treatment of cancer, or pharmaceutically effective 
fragments thereof. By pharmaceutically effective fragment, the inventors mean a fragment 
of the molecule which still retains the abitity to be a prophylactant or to treat cancer. The 
cancer may be a gastiro-intestinal cancer, such as gastiic cancer or colorectal cancer. 

The molecules are preferably administered in a pharmaceutically amount. Preferably tiie 
dose is between 1 ng/kg. to 10 mg/kg. 

The nucleic acid molecules may be used to form DNA-based vaccines. From the published 
Uterature it is apparent that tiie development of protein, peptide and DNA based vaccines 
can promote anti-tumour immune responses. In pre-clinical studies, such vaccines 
effectively induce a delayed type hypersensitivity response (DTH), cytotoxic T-lymphocyte 
activity (CTL) effective in causing the destruction (death by lysis or apoptosis) of the. 
■ cancer cell and tiie induction of protective or tiierapeutic immunity. In clinical tiials 
peptide-based vaccines have been shown to promote tiiese immune responses in patients 
and in some instances cause the regression of secondary malignant disease. Antigens 
expressed in prostate cancer (or otiier types of cancers) but not in normal tissue (or only 
weakly expressed in normal tissue compared to cancer tissue) will aUow us to assess their 
efacacy in tiie treatinent of cancer by immunotiierapy. Polypeptides derived from tiie 
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tumour antigen may be administered with or without immunological adjuvant to promote 
T-cell responses and induce prophylactic and therapeutic immunity. DNA-based vaccines 
preferably consist of part or all of the genetic sequence of the tumour antigen inserted into 
an appropriate e5q)ression vector which when injected (for example via the intramxiscular, 
subcutaneous or intradermal route) cause the production of protein and subsequentiy 
activate the inmiune system. An alternative approach to therapy is to use antigen 
presenting cells (for example, dendritic cells, DCs) either mixed with or pulsed with 
protein or peptides from the txamour antigen, or transfect DCs with the expression plasmid 
(preferably inserted into a viral vector which would infect cells and deliver the gene into 
the cell) allowing the expression of protein, and the presentation of appropriate peptide 
sequences to T-lymphocytes. 

Accordingly, the invention provides a nucleic acid molecule according to the invention in 
combination with a pharmaceutically-acceptable carrier. 

A further aspect of the invention provides a method of prophylaxis or treatment of a cancer 
such as a gastro-intestinal cancer comprising the administration to a patient of a nucleic 
acid molecule according to the invention. 

The polypeptide molecules according to the invention may be used to produce vaccines to 
vaccinate against a cancer, such as a gastro-intestinal cancer. 

Accordingly, the invention provides a polypeptide according to the invention in 
combination with a pharmaceutically acceptable carrier. 

The invention further provides use of a polypeptide according to the invention in a 
prophylaxis or treatment of a cancer such as a gastro-intestinal cancer. 



Methods of prophylaxis or treating a cancer, such as a gastro-intestinal cancer, by 
administering a protein or peptide according to the invention to a patient, are also provided. 
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Vaccines comprising nucleic acid and/or polypeptides according to the invention are also 
provided. 

The polypeptides of the invention may be used to raise antibodies. In order to produce 
antibodies to tumour-associated antigens procedures may be used to produce polyclonal 
antiserum (by injecting protein or peptide material into a suitable host) or monoclonal 
antibodies (raised using hybridoma technology). In addition PHAGE display antibodies 
may be produced, this offers an alternative procedure to conventional hybridoma 
methodology. Having raised antibodies which may be of value in detecting tumour antigen 
in tissues of cells isolated jfrom tissue or blood, their useftdness as therapeutic reagents 
could be assessed. Antibodies identified for their specific reactivity with timiour antigen 
may be conjugated eithCT to drugs or to radioisotopes. Upon injection it is . anticipated tiiat 
these antibodies locaKse at the site of tiraiour and promote the death of tumour cells 
through the release of drugs or tiie conversion of pro-drug to an active metabolite. 
Alternatively a lethal effect may be delivered by .tiie use of antibodies conjugated to 
radioisotopes. In tiie detection of secondary/residual disease, antibody tagged with 
radioisotope could be losed, allowing tumour to be localised and monitored during the 
course of therapy. 

TTie term "antibody" includes intact molecules as well as fragments such as Fa, F(ab')2 and 
Fv. 

The invention accordingly provides a method of treating a gastro-intestinal cancer by the 
use of one or more antibodies raised against a polypeptide of the invention. 

The cancer-associated protems identified may form targets for therapy. 

The invention also provides nucleic acid probes capable of binding sequences of the 
invention under high stringency conditions. These may have sequences complementary to 
the sequences of the invention and may be used to detect mutations identified by the 
inventors. Such probes may be labelled by techniques known in the art, e.g. with 
radioactive or fluorescent labels. 
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Preferably the gastro-intestinal cancer which is detected, assayed for, momtored, treated or 
targeted for prophylaxis, is a gastric cancer or a colorectal cancer. Most preferably, the 
cancer is a gastric carcinoma or a colonic carcinoma, more preferably a gastric 
adenocarcinoma or a colonic adenocarcinoma. 

The invention will now be described by reference to the following figure and examples: 
Figure 1 

(A) Schematic representation of TACCl-A exon composition and functional domains of 
the protein. Putative coiled-coil domain and nuclear localization signals (NLS) were 
predicted using sequence analysis tools at the SEREX web-site 
(http://www4udwig.imil.ch/SEREX) 

(B) The 5' region of TACCl gene and the mRNA variants identified by 5'RLM-RACE. 
Exon-intron composition of TACCl was detemiined by comparing the cDNA sequences 
with Hie working draft of the human genome. The complete 5' end sequences of TACCl -F 
and -E variants are not known. Potential translation initiation codons are marked with an 
asterisk but primers for expression analysis are indicated by arrows. 

(C) Expression of the identified TACCl mRNA variants in normal tissues and 4 
specimens of gastric cancer (T) and adjacent normal tissues (N) analysed by RT-PCR. 
Amplification of GAPDH and TACC-CCD' (coiled-coil domain, exons 8-11) was 
determined to be within the linear phase thus allowing comparison of mRNA levels. 

Figure 2. Expression of AD034 mRNA in normal tissues and autologous tumour (Col T) 
analysed by RT-PCR. GAPDH was amplified as an internal control and demonstrates the 
equal amounts of mRNA used for RT-PCR. 

Figure 3. An example of comparison of AD034 mRNA levels between cancerous and 
adjacent non-cancerous tissues by RT-PCR. Cycling conditions were optimised so that the 
RT-PCR products were analysed when amplification is within the linear phase. Ethidixmi 
bromide stained gels were scanned on digital gel documentation system, intensities of 
bands were calculated and relative expression coefiBcients were determined using standard 
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curves of amplification and expression of each target gene was normalised to that of 
p-actin and GAPDH. In the example showed, 4.8-fold increase (the mean values of two 
independent experiments) of AD034 in cancerous tissues was observed (normalised to 
P-actin)." 

Tftr.hniqne used to identify genes encoding tumour antig ens fSEREy tftr.hnigiie) 

The technique for the expression of cDNA libraries from human tissue moderately 
differentiated, ulcerated gastric adenocarcinoma and moderately differentiated colon 
adenocarcinoma is described, and was performed according to published methodology 
(Sahinetal. ProcNatl. Acad. Sci. 92, 11810-11813, 1995). 

SEKEX has been used to analyze gene expression in tumour tissues from human 
melanoma, renal cell cancer, astrocytoma, oesophageal squamous cell carcinoma, colon 
cancer, lung cancer and Hodgkin's disease. Sequence analysis revealed that several 
different antigens, including HOM-MiEL-40, HOM-HD-397, HOM-RCC-1.14, NY- 
ESO-1, NY-LU-12, NY-CO-13 and MAGE genes, were expressed m these malignancies, 
demonstrating that several human tumour types express multiple antigens capable of 
eliciting an imm\me response in the autologous host. This represents an alternative and 
more efficient approach to identify tumour markers, and offers distinct advantages over 
previously used techniques: 

1 ) the vise of fresh tumour specimens to produce the cDNA libraries obviates 
the need to culture tmnom cells in vitro and therefore circumvents artefacts, sudi 
as loss or neo-antigen expression and genetic and phenotypic diversity generated 

- by extended culture; - 

2) the analysis is restricted to antigen-encoding genes ejqjressed by the tumour 

in vivo; " • 

3) using cDNA e3q>ression cloning, the serological analysis (in contrast to 
autologous typing) is not restricted to cell surfece antigens, but covers a more 
extensive repertoire of cancer-associated proteins (cytosolic, nuclear, membrane, : 
etc.); 
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4) in contrast to techniques using monoclonal antibodies, SEREX uses 
poly-specific sera to scJrutimse single antigens tiiat are highly enriched in lytic 
bacterial plaques allowing the efficient molecular identification of antigens 
foUowing sequencing of the cDNA. Subsequently the tissue-expression spectrum 
of the antigen can be deteraiined by the analysis of the mRNA ejcpression patterns 
using,.for example, northern blotting and reverse transcription-PCR (RT-PCR), on 
firesh normal and malignant (autologous and allogeneic) tissues. Likewise, the 
prevalence of antibody in cohorts of canc6r patients and normal controls can be 
determined. . 

TACCl-D identification 

cDNA clone Ga55 encoding TACCl was isolated &om gastric cancer cDNA ejcpression 
library by immunoscreening with autologous patient's serum using SEREX. This clone 
reacted exclusively with flie patient's serum but not with sera from healthy individuals 
(n=35). The reactivity of autologous serum to TACCl protein was also confirmed by 
Western blot analysis using a recombinantiy expressed TACCl fragment. Comparison of 
Ga55 cDNA (GenBank Accession number AY039239) with the previously published 
TACCl sequence (AF049910) showed that Ga55 represents a TACCl spUce variant 
generated b;;^ inclusion of alternative 36-bp exon and that the clone contains a partial .cDNA 
sequence truncated at both 5' and 3' ends. Additionally, alignment of corresponding ESTs 
indicated that several other 5' variants of the transcript may be generated by alternative 
splicing. In order to analyse the exon composition of TACCl mRNA 5' variants expressed 
in gastric cancer tissue and to determine the transcription start sites of these mRNAs the 
inventors performed RNA-Ligase-Mediated Rapid Amplification of cDNA Ends 
(RLM-RACE) using a FirstChoise™ RLM-RACE kit (Ambion) ' according to 
manufacturer's protocol. 

lOpg of total RNA was isolated firom gastric cancer tissues and treated with Calf Intestinal 
Phosphatase to remove 5 '-phosphates from un-capped RNAs, then cap structure was 
removed firom fiill-length mRNA by Tobacco Acid Pyrophosphatase (TAP) and RNA 
adapters were ligated to mRNA molecules containing 5'phosphate. A random-primed 
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reverse transcription and nested PGR with gene-specific and adapter-specific primers were 
performed. 

TACCl-D " " 

forward primer 5*-.ccaagttctgcgccatggg~3' 
. reverse primer 5'-aatttcacttgttcagtagtc-3* 

AD034 

forward primer 5'-cttatctctcaaaggccatgg-3' 
reverse primer 5'-gattttctaggagtgcaggg-3' 

The RNA sample, which has not been treated with TAP, was carried through the. adapter 
ligation and RT-PCR, as a negative control to demonstrate that the RLM-RACE products 
are generated by amplification of the 5' ends of fbll-length (decapped) RNA. Two bands 
of approximately 240-bp and 280-bp were detected when gene-specific primers located in 
exon 4 were used. These PGR products were cloned using InsT/Aclone™PCR Product 
Cloning Kit (Fermentas, Lithuania) and at least 10 plasmid clones containing each PGR 
product were sequenced on ABl PMSM 310 automatic sequencer (AppUed Biosystems). 
Comparison of the obtained sequences to the pubUshed TACCl mRNA sequence (here 
designated as TACGl-A) and to the working draft of human genome 
(www.ncbi.nlm.mh.gov) showed that these RLM-RACE products represent three novel 
TAGGl mRNA variants, designated TACCl-B, TACCl -C and TACCl-D (Fig. IB). The 
first exons of these transcripts were not present in the published TACCl-A mRNA, but 
comparison with the genomic sequence (NT_008251) showed that exon la is located 53.65 
Kb and exon lb - 82.3 Kb upstream firom the first exon of TAGCl-A, suggesting that these 
transcript variants are under the control of different promoters. The transcription start site 
in exon lb seems to be fixed as no differences among individual clones were detected. In 
contrast, the start site in exon la is scattered withm 100-bp region. No transcript variants 
corresponding to the clone Ga55 and pubUshed TACCl-A sequence were detected in 
RLM-RACE analyses likely reflecting an advantage for more abundant and/or shorter 
mRNA species in this PGR-based technique. 



The inventors then designed a set of isoform-specific primers to analyse the e3q>ression of 
TACCl isofoims in normal and cancerous tissues. The sequences of the primers are shown 
in Table 1 and their location is indicated in Fig. IB. 

Table 1 



Primers used for amplification of TACCl transcript variants and controls 



Isoform/gene . 




.Primer seauence^ /'^'-'^'^ 


L\%3m OX cycles 


1 * 1 

proaucE 
(DP) 


TACCl-A 


F 


AGGAGGAGGATTCGCAAGC 


35 


387 




R 


TTGTTCCGAGGACTGCCGAG 






TACCl-B 


F 


CCTCGCCGAAGAGGAGTGG 


37 


252 




R 


TGGTAGACACAGGAACATTGG 






TACCl-C . 


F 


CCACGGAGACCGCGAGTG 


36 


252 




R 


TGGTAGACACAGGAACATTGG 






TACCl-D 


F 


CCAAGTTCTGCGCCATGGG 


38 . 


112 




R 


AATTTCACTTGTTCAGTAGTC 






TACCl-E 


F 


GAGAGATGCGAAATCAGCG 


35 


432 




R 


TTGTTCCGAGGACTGCCGAG 






TACCl-F 


F 


CTTTGACGAATCCATGGATCC 


31 


129 




R 


AATTTCACTTGTTCAGTAGTC 






TACC-CCD 


F 


AAATACGAAGAGACCCGGC 


28 


349 




R 


TGTCCAGTTTCTCTTCTGCG 






GAPDH 


F 


GTCATCCCTGAGCTAGACGG 


25 


356 




R 


GGGTCTTACTCCTTGGAGGC 







Location of primers is indicated by arrows in Fig. IB. TACC-CCD - region of TACC 
encoding coiled-coil domain, F - forward primer, R - reverse primer. 



Initially when the primers used for amplification were located within exons lb and 5, a 
1500-bp band was detected in addition to the expected 3 1 8-bp (TACCl-B) product. Direct 
sequencing of liiis RT-PCR product revealed one more TACCl splice variant (designated 
TACCl -E), however the complete 5' end sequence for this variant is not known. The 
mRNA expression of the isoforms was analysed in a.panel of normal tissues (brain, liver, 
heart, kidney, limg, trachea, (Clontech) spleen, colon, stomach, testis and ovary (Ambion)) 
and tumour and adjacmt tissues of 10 patients diagnosed with gastric adenocarcinoma. 
Fragments of GAPDH and TACCl coiled-coil domain (exons 8-11) were amplified as 
controls to demonstrate liiat equal amounts of total mRNA are used for analysis. Optimal 
cycling conditions (input cDNA and number of cycles) for the controls were determined so 
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that the amount of PGR product is in liner relationship from the amount of input cDNA. 
Linearity of the amplification was confirmed by a series of PGR with 1.5-fold dilutions of 
input cDNA. In analysis of the isoform expression, additional cycles of amplification were " 
performed to increase the sensitivity of the assay, which may reduce the linearity of 
amplification in some cases. Transcript variants A, B, G and E were expressed in aU 
normal tissues analysed and no significant differences between cancerous and adjacent 
tissues were observed.. From the normal tissues analysed, TACCl-F was strongly 
expressed only in brain* and weakly detectable ia lung and colon. TACCl-D was almost : 
imdetectable in any of normal tissues witli only trace amounts detected in kidney and colon 
after 38 cycles of amplification. At the same cycling conditions relatively strong 
TAGGl-D expression was observed in 5 out of 10 specimens of gastric cancer tissues 
while very, faint signals were detected in two of the adjacent tissue samples. TAGCl-F 
expression was detected in normal brain tissue and at a similar level in 6 specimens of 
gastric cancer, however it also was detectable as a weak signal in most adjacent tissues. 
Analysis of differentially expressed isoforms and controls is shown in Figure 2B. The 
number of cycles required to yield a detectable product (shown in Table 1) is unlikely to 
represent the relative abundance of the isoforms due to variations in efficiency of the 
primers, therefore the inventors cannot estimate ratio of the isoforms. Go-amplification of 
TAGGl-A/E and F, and TAGGl-G/D showed that both TAGGl-F and D are less abundant 
in gastric cancer cells than TAGGl-A/E and G, respectively (Fig. 2G). Despite the 
overexpression TAGCl-D and F variants in tmnours the inventors did not observe 
significant differences in total TAGGl level (TAGG-CGD) between cancerous and 
non-cancerous tissues of these patients. This shows that regulation of mKNA splicing 
rather than expression level of TACGl is altered in gastric tumours. Both TAGGl-F and D 
contain exon 4a that is not included in any other transcript variant. Presumably the splice 
sites of the alternative exon are "weaker* * and are not recognised by the sphcing machinery 
in normal tissues except brain. The mechanism of altered splice site selection in cancer 
cells is not known, although it has been shown that mutations or sequence polymorphisms 
in splice regulatory sequences, changes in splicing factors and activation of particular 
signal transduction pathways may modulate the use of alternative splice sites (Philips A.V., 
et aLl Gell Mol. Life Sci., Vol. 57, pages 235-249). Alterations in the splicing pattern or 
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efficiency of several genes have been implicated in tuinour progression (for example, 
CD44, WT-1, C-CAMl) and susceptibiUty to cancer (for example, BRCAl, CYP3A). 

Like mutations, altered splicing can serve as one of the mechanisms for the generation of 
protein diversity contributing to the selection of more aggressive tumour cells (Philips 
A. v.. Supra and Cooper T.A-, Am. J. Hum. Gend. (1997), Vol. 61, pages 259-266). Here 
the inventors show tiiat the regulation of aitemative splicing of TACCl is perturbed in 
primary gastric tumours. Both of the differentially expressed isoforms can be exploited as 
biomarkers for gastric cancer and the study of their prognostic significance is currently 
being investigated. Although the function of the TACCl isoforms is not known, the 
inventors propose that aberrant expression of TACCl-D and F isoforms appears to 
contribute to centrosome malfunction. Various centrosome abnormalities, including 
atypical size, shape and increased number, are observed in most of the common human 
cancers but little is known about the underlying genetic alterations, Centrosome defects are 
known to lead to tite formation of multipolar spindles and chromosome segregation errors, 
see, for example, Salisbury J.L., J. Mamm. Gland. Biol. Neoplasia (2001), Vol. 6, pages 
203-212; Sato N., et al. Cancer Genet. Cytogenet. (2002), Vol. 126, pages 13-19; 
Duensing SCX Munger K., Biochem. Bioplys. Acta. (2001), Vol. 1471, M81-M88; Marx 
J., Science (2001), Vol. 292, pages 426-429. 

The identified TACCl isoforms differ in their N-terminal regions but share identical 
coiled-coil domain. The coiled-coil domain interacts with microtubules by cooperating 
wifli another microtubule-associated protein (Msps in Drosophild) which stabilises 
centrosomal microtubules (Lee, et al.. Supra). TACCl-A protein is distributed in the 
cytoplasm and nucleus in interphase but it concentrates at centrosomes and on 
micrptubules during mitosis; the N-terminal domain appear to be required for proper 
subcellular distribution during the cell cycle. In fact, TACCl -A and TACCl -E contains 
two nuclear localisation signals which are absent in the four shortest splice variants. 

Experiments with Drosophila have shown that decreasing the level of D-TACC protein 
leads to the formation of abnormally short centrosomal ndcrotubules and subsequently to 
severe mitotic defects (Gergely, et al.^ Supra). In contrast, overexpression of TACC-D 
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leads to the formation of large, highly ordered protein aggregates around the centrosomes • 
and an increase in the number and/or length of centrosomal microtubules (Lee, et al.. 
Supra) When coiled-coil domains of human TACC proteins are overexpressed in HeLa 
cells, ^ey form similar polymeric s^K^es in the cytoplasm, fidl-length TACCl-A also 
forms polymers, but they are less compacted and clustered around the nucleus (Gergely, et 
al Supra) This shows that perturbations in TACC gene expression could contnbute to the 
mitotic defects and genetic instabiUty. Hie inventors propose that deregulation of 
dtemative spUcing resulting in inappropriate expression of TACCl isoforms in gastric 
cancer could result in the dysfunction of TACCl. It is possible that the fomiation of such 
protein aggregates might have served as an immunogenic stimuli, in the cancer patients 
resulting in Hxe production of anti-TACCl antibodies. In the study, the antibody response 
to TACCl was restricted to the autologous patient but interestingly, both TACCl and 2. 
have been detected by SEREX in gastric cancer by Y. Obata (SEREX database). If the 
B-cell response to TACCl in patients is eUcited by the formation of protein aggregates as a 
consequence of deregulated TACCl expression, given lixe restricted expression of, e.g. 
TACCl-D, some of the isoforms are a target for vaccine based immunotherapy. 

Furthermore, the fimctional differences of the isoforms are likely to differ, thus making 
Ihem a target for compounds affecting their activity. 

TACCl-D is especially of interest because of its specific expression in relatively high 
amounts in gastro-intestinal cancers. 



Ann^4 Isolation 



Tissue specimens and patient sera 

Color«tal caacer tissue and the adjacent non-canccrons (issue specimens 4om 15 patients 
undergoing s«rg«y at fte Lat«an Oncology Center were resected and ftozen in 
^gen inunediately after the surgery. CUnico-patholopc data, including Urology, depth 
of invasion, lyn^h nod« and Uver metastasis. Dukes' stage, etc.. were obtained ftom the 
clinical records, to addition, serum samples were obtained fiom colon, stomach and breast 
cancer patients undergoing diagiostic procedures and ftom healthy volunteers. ll>e study 
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was approved by Committee of Medical Ethics of Latvia and the tissue samples and sera 
were collected after the patients' informed consent was obtained. 

Isolation of total RNA and construction of cDNA library 

Total RNA was isolated from tumour and normal tissue samples, using Trizol reagent 
according to manufacturer's protocol (Life Technologies, Inc.)- A cDNA expression 
library was constructed from tumour specimen of a moderately diJBferentiated 
adenocarcinoma of colon. Poly(A)'' RNA was purified from total RNA using Dynabeads 
mRNA Purification kit (Dynal AS, Norway) and cDNA was ligated into the lambda 
Uni-SAP XR vector using Gigapack m Gold cloning kit (Stratagene GmbH). After in vitro 
packaging, a library containing 10^ primary cDNA clones was obtained and amplified once 
prior to immimoscreening. 

Immanoscreening 

Immunoscreening of the cDNA library was performed as described by Sahin, et ah, (1995) 
Supra. Briefly, E. coli XLl blue MRF' cells were transfected with the recombinant phages, 
plated at a density of approx. 5000 pfij/150-mm plate (NZCYM-IPTG-agar) and followmg 
8 hr. incubation at 37°C transferred to nitrocellulose filters. In order to eliminate cDNA 
clones encoding human imunoglobins, filters were pre-screened with AP-conjugated rabbit 
anti-human secondary antibody (Pierce, USA) prior to incubation with sera, and reactive 
plaques were detected with 5-bromo-4chloro-3-indolyl-phosphate (BCIP)/ nitroblue 
tetrazolium (NBT) and marked. Then filters were incubated with 1:250 diluted patient's 
serum, which had been previously preabsorbed with E. coli-phage lysate, serum-reactive 
clones were detected with AP-conjugated secondary antibody and visualised by incubating 
with BCIP/NBT. The reactive phage clones were subclonied to monclonalrfy and converted 
to pBluescript phagemids. To assess frequencies of antibody responses to the 
SEREX-defined antigens in allogeneic sera, E. coli were transfected directly on the gridded 
agar plate, by spotting 1 \il of monoclonal positive phage (20-30 pfu/|al) side by side with 
non-recombinant phages. "Phage arrays" wrare screened with 1:200 diluted allogeneic sera 
as described above, excluding the IgG pre-screening step. 
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DNA sequencing and sequence analysis 

Phagemid DNA was purified using QIAprep Spin Miniprep kit (QIAGEN GmbH), 
analysed by EcoBI/ Xhol restriction enzyme digestion and clones representing different 
oDNA inserts were sequenced using BigDye Terminator Cycle Sequencing Ready Reaction 
kit on an ABI PRISM 3100 genetic analyser (Applied Biosystems). Gene-specific primers 
were designated to obtain fiiU insert sequences. Genes were identified by homology search 
through the GenBank data base (www.ncbi.hIm.nih.gov/BLAST). . Chromosomal 
localisation and exon-inti:on organisation of the cDNAs was determined by comparison to 
the working draft of the human genome. Putative protein domains were predicted by 
scanning the sequences against PROSITE (www.expasy.org) and by using tools for 
sequence analysis at the SEREX web-site (www-ludwig.unil.ch/SEREX). 

Western blot anafysis 

Immunoreactivity to the recombinant proteins in serum-reactive clones was confirmed by 
Western blot analysis. E. coli XLl-Blue cells were transformed with the recombinant 
pBluescript phagemids excised from the Uni-ZAP XR vector. The cells were grown in LB- 
medium witii ampicillin to OD of 0.4 at 540nm and then transcription from tiie lacZ 
promoter was induced witii 2mM IPTG. Samples of the bacterial cultures were collected 
before induction and 3 and 5 after the protein expression was induced. The cells were 
lysed with 3xLaemli buffer, lysates were separated by SDS-PAGE and blotted to Hybond 
c-extra filters (Amersham Biosciences). The filters were blocked witii fet-firee milk, 
incubated withtiie autologous patient serum and antigen-antibody complexes were detected 
with HRP-conjugated rabbit anti-human antibody using an ECL detection system 
(Ainersham Biosciences). 

Comparative RT-PCR analysis 

The mRNA expression pattern of SEREX-defined antigens was analysed by RT-PCR using 
a panel of normal tissue RNA (whole brain, Uver, heart, kidney, lung, trachea) (Clontech), 
(stomach, colon, spleen, testis, ovary) (Ambion), PBLs and a specimen of colon cancer of 
the autologous patient. Relative mRNA levels were compared between cancerous and 
adjacent non-cancerous tissues of 15 patients by comparative RT-PCR. The first-strand 
cDNA was syntiiesised fix>m 4 ^.g of total RNA primed with oUgo-dT(18) and random 
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hexamer primexs using a First-Strand cDNA Synthesis Kit (Fermentas, Litiauania). Gene 
specijfic PGR primers located within different exons were designed to amplify cDNA 
fragments (250-350 bp in length). of AC)034 genes and GAPDH and p-actin as internal 
standard genes. One fiftieth of RT mixture was amplified in GeneAmp PGR System 2400 
thermal cycler (Perkin-Ehner Goip.) in a total reaction volvune of 20 pi containing 10 
pmole of each primer, 200 i^MxINTPs and 2 U of Taq polymerase (Fermentas, Lithviania). 
Optimisation of cycling conditions (amount of input cDNA and niunber of cycles) was 
performed as described by Toh, et al. Int. J. Cancer (1997), Vol. 72, page 459. 
Amplification of all target genes was performed simultaneously, at the same cycling 
conditions (45s at 94°G, 30s at 58''G, 45s at 72°G), except for the number of cycles that was 
different for the amplification of each target gene.. The primer sequences, number of cycles 
used and length of PGR products are shown in Table 2. The quantity of RT-PCR products 
was determined densitomertrically after scanning the ethidium bromide stained gel on 
digital gel documentation and analysis system GDS8000 (Ultra-Violet Products Ltd., UK) 
and the intensities of bands ware calculated using Gel Works software. Standard curves of 
amplification of each target gene were constructed from a series of PGRs with ten 1.5-fold 
dilutions of the colon cancer cDNA. Amounts of PGR products were linearly dependent 
from input cDNA over 10-fold dilutions of cDNA. The relative amounts of target mRNAs 
were normalised to GAPDH and p-actin. The obtained values in tumours (T) were 
compared to those m matched normal epithelium (N) and T/N ratios were calculated for 
each mRNA in each patient's tissue samples. Each reaction was performed in duplicate. 

5' RLM-RACE of Co23 (AD034) 

The full-length 5' end of G623 cDNA sequeaice was cloned frpm colon cancer tissues of 
autologous patient vising FirstGhoise^ RLM-RACE kit (Ambion) according to the 
manufecturer's protocol. Briefly, 10 |ag of total RNA were treated with Calf Intestinal 
Phosphatase to remove 5'-phosphates fi»m uncapped RNAs (degraded mRNA, rRNA, 
tE?NA or DNA), ihsa. the cap structure was removed from the ftill-length mRNA by 
Tobacco Acid Pyrophosphatase and RNA adapters were ligated to mRNA molecules 
containing 5 'phosphate. A random-primed RT-nested PGR with gene-specific and 
adapter-specific primers was performed,' products were cloned using InsT/Acldne™PCR 
Product Cloning Kit (Fermentas, Lithuania) and multiple clones were sequenced. 
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Table 2 

Primers used for expression analysis of SEREX-defined antigens 



Gene 




Primer sequences (5 '-3') 


No. of cycles 


Size of 
product (bp) 


AD034 

AD034'' 
(ex.2-3) 
P-actin 

GAPDH 


F 

R 

F 

R 

F 

R 

F 

R 


CTTATCTCTCAAAGGCCATGG 

GATTTTCTAGGAGTGCAGGG 

ATGATGATGACTGGGACTGG 

GTAAGACCTTGTCTGCTGG 

AGTGTGACGTGGACATCCG 

AATCTCATCTTGTTTTCTGCGC 

GTCATCCCTGAGCTAGACGG 

GGGTCTTACTCCTTGGAGGC 


28 
32 
20 
25 


276 
176vl44 
351 
. 356 



miese sets of primers were used for analysis of expression of RHAMM and 

variants, respectively, 

F-forward primer, R-reverse primer. 
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RESULTS 



Immunoscreening and identification of imnmnoreactive cDNA clones 

Fourteen serum-reactive cDNA clones were detected by immunoscreening of 8 x 19^ pfa 
from a colon cancer cDNA expression library with autologous patient's serum. The clones 
were purified, M-length sequences of their cDNA inserts , were obtained and the genes 
were identified by homology search through the GeoBank data base. 

mRNA expression of SEREX-defined antigens 

mRNA expression of SEREX-defined antigens was analysed by RT-PCR in normal tissues 
(brain, liver, heart, kidney, lung, trachea, spleen, colon, stomach, testis, ovary and PBLs) 
and in a specimen of colon cancer tissue of the autologous patient. Cycling conditions and 
the optimal number of cycles were chosen so that the PGR products were at the liner phase 
of ampUfication. GAPDH and p-actin were used as controls for RNA integrity and 
quantity. Tlus allows to assess the abundance of each mRNA in normal tissues relative 1» 
the autologous colon cancer tissue. Co23 was expressed in testis, spleen, colon, stomach 
and colon cancer tissues (Figure 2). 

Comparison of mRNA levels in colon cancer and adjacent non-cancerons tissues 
To determine v^ete the antigens showing relatively high expression in the autologous 
tumour are overexpressed in other colorectal cancer, tiie inventors compared their relative 
mRNA levels between cancerous and paired adjacent tissue specimens of 15 patients with 
colorectal cancer by RT-PCR. The conditions for ampUfication of each target gene were 
optimised so that the amount of PGR product was in liner relationship to the amount of 
input cDNA, at least over 10-fold dilution of input cDNA. GAPDH and p-actin were used 
as internal controls. An example of analysis is shown in Figure 3. Relative quantities of 
RT-PCR products were determined by densitometric analysis, the amounts of target 
cDNAs were normalised to that of p-actin or GAPDH and tumour/normal ratios were 
calculated. Ratios ^ (the mean values of two independent experimente) were consida^i 
to represent significant overexpression. They observed a 2.0-4.8-fold increase of Co23 
(AD034) in 4 specimens of colon cancer when compared to the adjacent tissues 
(normalised to p-actin). 
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Cloning of AD034 mRNA 5' variants 

Clone Co23 contains a partial cDNA sequence encoding hypothetical protean AD034. The 
longest ORF encodes a 561-amino- acid protein of approximately 64.6 kDa.. Comparison of 
Ihe predicted amino acid sequence with PROSITE and Pfam databases revealed a similarity 
to the RIO1/ZK632.3/MJ0444 protein family (aa 186-380), the tyrosine kinase atotive-site 
signature (aa 313-325) and the aspartic acid and lysine-rich regions (aa 10-66 and 514-561, 
respectively). To determine Ihe transcription start site and to search for possible sequence 
variations in the 5' region of AD034 that was absent in clone Co23, 5'RLM-RACE 
analysis was performed using total RNA fiom tumour tissues of the autologous patient. 5' 
ends of the sequenced RLM-RACE clones differed by 154-bp, indicating that the 
transcription start site of AD034 is scattered within this region. The longest RACE clone 
extended the AD034 mRNA sequence by 37-bp, however no additional translation 
initiation site was found. Of the 8 clones sequenced, three contained an insertion of 32-bp 
(submitted to GenBank, AY094356). Alignment with the genomic sequence (NT_023412) 
showed that the inserted 32-bp are derived firom the intronic sequence flanking exon 3 and 
presumably are included in the mRNA by use of cryptic spUce site. The insertion shifts the 
reading frame and introduces a stop codon resulting in a truncated ORF of 91aa. RT-PCR 
analyses of expression of the splice variants showed that the transcript containing the 32-bp 
sequepce is just a minor mRNA variant and is detectable in all normal tissues where 
AD034 is expressed and no significant differences in the ratios of either of the splice 
variants were observed between colorectal tumours and adjacent normal tissues. 
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DISCUSSION 



Clone Co23 encodes a hypothetical protein AD034. Analysis of the predicted amino acid 
sequence revealed a tyrosine kinase motif and a similarity to RIO1/ZK632.3/MJ0444 
family - evolutionary related uncharacterised proteins. The inventors observed a relative 
upregulation of AD034 mRNA expression in several colon cancer cases, however the 
significance of AD034 expression in cancer development is unknown. The inventors also 
cloned a novel AD034 transcript variant, generated by use of a cryptic splice site. 
Translation of this transcript results in a truncated protein of 91 amino acids. However, 
RT-PCR analysis showed that the novel transcript variant represents less than 10% of the 
AD034 mRNA and is also detectable in several normal adult tissues, including normal 
colon thus showing that expression of this splice variant is not likely to be associated with 
immune recognition of AD034 in cancer patients, however the biological role of this 
isoform remains to be investigated. 
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Claims 



1. Hie use of an isolated nucleic acid molecule comprising a sequence selected from 
SEQ.ID.1, SEQ.E).2 and SEQ.ID.3 to detect or monitor cancer. 

2. The use of' a nucleic acid probe which is capable of hybridising under high 
stringency conditions to an isolated nucleic acid molecule comprising a sequence selected 
from SEQ.ID. 1, SEQ.ID.2 and SEQ.ID.3 to detect or monitor cancer. 

3. A method of detecting or monitoring cancer comprising the step of detecting or 
monitoring elevated levels of a nucleic acid molecule comprising a sequence selected from 
SEQ.ID.1, SEQ.nD.2 and SEQ.ID.3 in a sample from a patient. 

4. A method of detecting or monitoriaig cancer comprising the use of a nucleic acid 
molecule or probe according to claim 1 or claim 2 in combination with a reverse 
transcription polymerase chain reaction (RT-PCR). 

5. A method of detecting or monitoring cancer comprising detecting or monitoring 
elevated levels of a protein or peptide comprising an amino acid sequence encoded by a 
nucleic acid sequence selected from SEQ.ID.1, SEQ.ID.2 and SEQ.ID.3. 

6. A method according to claim 5 comprising the use of an antibody selective for a 
protein or peptide as defined in claim 5 to detect the protein or peptide. 

7. A method according to claim 7 comprising the use of an Enzyme-linked 
Immimosorbant Assay (ELISA). 
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8. Use or metbiod according to any one of claims 1 to 7, wherein the cancer is a 
gastro-intestinal cancer. 

9. A Idt for use with a method according to any one of claims 3-8 comprising a nucleic 
acid, protein or peptide, or an antibody as defined in any one of claims 3-8. 

.10. A method of prophylaxiS or treatment of cancer comprising administering to a 
patient a phannaceutically effective amount of nucleic acid molecule comprising a nucleic 
acid sequence selected firom SEQ.E).l, SEQ.ID.2 and SEQ.ID.3 or a phannaceutically 
effective firagment thereof. 

11. A method of prophylaxis or treatn^ent of cancer comprising administering to a 
patient a phannaceutically effective amount of a nucleic acid molecule hybridisable xmder 
high stringency conditions to a nucleic acid molecule comprising a nucleic acid sequence 
selected firom SEQ.ID.l, SEQ.ID.2 and SEQ.ID.3 or a pharmaceutically effective firagment 
thereof. 

12. A method of prophylaxis or treatment of cancer comprising administering to a 
patient a pharmaceutically effective amount of a protein or peptide comprising an amino 
acid sequence encoded by a nucleic acid sequence selected firom SEQ.ID.1, SEQ.ID.2 and 
SEQ.ID.3 or a pharmaceutically effective firagment thereof, 

13. A method of prophylaxis or treatment of cancer comprising the step of 
administering to a patient a pharmaceutically effective amount of an antibody capable of 
specifically binding a protein or peptide comprising an amino acid sequence encoded by a 
nucleic acid sequence selected from SEQ.ID.1, SEQ.ID.2 and SEQ.ID.3. 
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14. " A method according to any one of claims 10 to 11, wherein the cancer is a 
gastro-intestinal cancer. 

15. A vaccine comprising a nucleic acid molecule having a nucleic acid sequence 
selected from SEQ.ID.1, SEQ.ID.2 and SEQ.ID.3 or a pharmaceutically effective fragment 
thereof and a phaimaceutically acceptable carrier. 

t 

16. A vaccine comprising a protein or peptide comprising an amino acid sequence 
encoded by a nucleic acid sequence selected from SEQ.ID,1. SEQ.ID.2 and SEQ.ID.3 or a 
pharmaceutically effective fragment thereof, and a pharmaceutically acceptable carrier. 

17. An isplated mammaUan nucleic acid molecule which codes for the foUowing amino 
acid sequence: 

MSRWPGQFDDADSSDSENRDLKTVKEKDDILFEDLQDNVNENG 

EGEIEDEEEEGYDDDDDDWDWDEGVGKLAKGYVWNGGSNPQANRQTSDSSSAKMSTPA 

DKVLRKFENKINLDKLNVTDSVINKVTEKSRQKEADMYRIKDKADRATVEQVLDPRTR 

MILFKMLTRGIITEINGCISTGKEANVYHASTANGESRAIKIYKTSILVFKDRDKYVS 

GEFRFRHGYCKGNPRKMVKTWAEKEMRNLIRLNTAEIPCPEPIMLRSHVLVMSFIGKD . 

DMPAPLLKNVQLSESKARELYLQVIQYMRRMYQDARLVHADLSEFNMLYHGGGVYI I D 

VSQSVEHDHPHALEFLRKDCANVNDFFMRHSVAVMTVRELFEFVTDPSITHENMDAYL 

SKAMEIASQRTKEERSSQDHVDEEVFKRAYIPRTLNEVKNYE^DMDIIMKLKEEDMAM 

NAQQDNILYQTVTGLKKDLSGVQKVPALLENQVEERTCSDSEDIGSSECSDTDSEEQG 

DHARPKKHTTDPDIDKKERKKMVKEAQREKRKNKIPKHVKKRKEKTAKTKKGK 



or a 



variant of a fragment thereof wHch encodes a prostate-associated antigen which is 
expressed in higher than normal concentrations in prostate cancer cells. 
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18. A vector comprising an isolated mammalian nucleic acid molecule according to 
claim 17. 

19. A nucleic acid molecule comprising at least 15 nucleotides, the nucleic acid 
molecule being capable of hybridising to a molecule according to claim 17 under high 
stringency conditions. 

20. An isolated protein or peptide comprising an amino acid sequence obtainable from 
a nucleic acid molecule according to claim 17, 18 or 19. 
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ABSTRACT 



Gastric and Colon Cancer-associated Antigens 

The application discloses cancer-associated genes and their products, especially those 
identifiable by SEREX. .The genes and products are used to identify, track and treat cancer. 
Preferably the cancer is a gastro-intestinal cancer. 
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